Efficient fitting of long-tailed data sets into hyperexponential distributions
نویسندگان
چکیده
We propose a new technique for fitting long-tailed data sets into hyperexponential distributions. The approach partitions the data set in a divide and conquer fashion and uses the Expectation-Maximization (EM) algorithm to fit the data of each partition into a hyperexponential distribution. The fitting results of all partitions are combined to generate the fitting for the entire data set. The new method is accurate and efficient and allows one to apply existing analytic tools to analyze the behavior of queueing systems that operate under workloads that exhibit long-tail behavior, such as queues in Internet-related systems.
منابع مشابه
Efficient fitting of long-tailed data sets into PH distributions ?
We propose a new technique for fitting long-tailed data sets into phase-type (PH) distributions. This technique fits data sets with non-monotone densities into a mixture of Erlang and hyperexponential distributions, and data sets with complete monotone densities into hyperexponential distributions. The method partitions the data set in a divide and conquer fashion and uses the Expectation-Maxim...
متن کاملSkew-slash distribution and its application in topics regression
In many issues of statistical modeling, the common assumption is that observations are normally distributed. In many real data applications, however, the true distribution is deviated from the normal. Thus, the main concern of most recent studies on analyzing data is to construct and the use of alternative distributions. In this regard, new classes of distributions such as slash and skew-sla...
متن کاملModeling and Analysis of Heavy-tailed Distributions via Classical Teletraac Methods
We propose a new methodology for modeling and analyzing heavy-tailed distributions, such as the Pareto distribution, in communication networks. The basis of our approach is a tting algorithm which approximates a heavy-tailed distribution by a hyperexponential distribution. This algorithm possesses several key properties. First, the approximation can be achieved within any desired degree of accu...
متن کاملFitting Mixtures of Exponentials to Long-Tail Distributions to Analyze Network Performance Models
Traffic measurements from communication networks have shown that many quantities charecterizing network performance have long-tail probability distributions, i.e., with tails that decay more slowly than exponentially. File lengths, call holding times, scene lengths in MPEG video streams, and intervals between connection requests in Internet traffic all have been found to have long-tail distribu...
متن کاملA Hyperexponential Approximation to Finite-Time and Infinite-Time Ruin Probabilities of Compound Poisson Processes
This article considers the problem of evaluating infinite-time (or finite-time) ruin probability under a given compound Poisson surplus process by approximating the claim size distribution by a finite mixture exponential, say Hyperexponential, distribution. It restates the infinite-time (or finite-time) ruin probability as a solvable ordinary differential equation (or a partial differential equ...
متن کامل